Active Memory Cube: A processing-in-memory architecture for exascale systems

نویسندگان

  • Ravi Nair
  • Samuel Antão
  • Carlo Bertolli
  • Pradip Bose
  • José R. Brunheroto
  • Tong Chen
  • Chen-Yong Cher
  • Carlos H. A. Costa
  • J. Doi
  • Constantinos Evangelinos
  • Bruce M. Fleischer
  • Thomas W. Fox
  • Diego S. Gallo
  • Leopold Grinberg
  • John A. Gunnels
  • Arpith C. Jacob
  • P. Jacob
  • Hans M. Jacobson
  • Tejas Karkhanis
  • C. Kim
  • Jaime H. Moreno
  • John K. O'Brien
  • Martin Ohmacht
  • Yoonho Park
  • Daniel A. Prener
  • Bryan S. Rosenburg
  • Kyung Dong Ryu
  • Olivier Sallenave
  • M. J. Serrano
  • P. D. M. Siegl
  • Krishnan Sugavanam
  • Zehra Sura
چکیده

A processing-in-memory architecture for exascale systems R. Nair S. F. Antao C. Bertolli P. Bose J. R. Brunheroto T. Chen C.-Y. Cher C. H. A. Costa J. Doi C. Evangelinos B. M. Fleischer T. W. Fox D. S. Gallo L. Grinberg J. A. Gunnels A. C. Jacob P. Jacob H. M. Jacobson T. Karkhanis C. Kim J. H. Moreno J. K. O’Brien M. Ohmacht Y. Park D. A. Prener B. S. Rosenburg K. D. Ryu O. Sallenave M. J. Serrano P. D. M. Siegl K. Sugavanam Z. Sura Many studies point to the difficulty of scaling existing computer architectures to meet the needs of an exascale system (i.e., capable of executing 10 floating-point operations per second), consuming no more than 20 MW in power, by around the year 2020. This paper outlines a new architecture, the Active Memory Cube, which reduces the energy of computation significantly by performing computation in the memory module, rather than moving data through large memory hierarchies to the processor core. The architecture leverages a commercially demonstrated 3D memory stack called the Hybrid Memory Cube, placing sophisticated computational elements on the logic layer below its stack of dynamic random-access memory (DRAM) dies. The paper also describes an Active Memory Cube tuned to the requirements of a scientific exascale system. The computational elements have a vector architecture and are capable of performing a comprehensive set of floating-point and integer instructions, predicated operations, and gather-scatter accesses across memory in the Cube. The paper outlines the software infrastructure used to develop applications and to evaluate the architecture, and describes results of experiments on application kernels, along with performance and power projections.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Study of Numerical Processing Speed, Implicit and Explicit Memory, Active and Passive Memory, Conservation Abilities, and Visual-Spatial Skills of Students with Dyscalculia

Background and Purpose: Learning disorder is one of the common disorders in students, which can lead to the occurrence of educational problems and secondary disorders in them. Based on psychopathological criteria, dyscalculia is one of the subcategories of learning disorder. Children with this disorder have problems in perception of spatial relations and in different cognitive abilities. Theref...

متن کامل

The comparison of active memory and sensory processing styles in boys and girls children with writing learning disorder

Introduction: The main problems of children with learning disorder are in cognition and their sensations. The present study aimed to investigate the comparison of active memory and sensory processing styles in boys and girls with writing learning disorder. Methods: The methodology of this descriptive study was the Control-case study type. The statistical population of this study was all girls ...

متن کامل

The Effects of Active Memory Exercises on Intelligence Profile in Students With Specific Learning Disorder

Background: Active memory is the search engine of the mind. Active memory is a cognitive function responsible for preserving instant information, its manipulation, and its use in thinking. This study aimed at investigating the effects of active memory practices on intelligence profiles in students with Specific Learning Disorder (SLD). Methods: This was a quasi-experimental study with a prete...

متن کامل

Toward a Memory-Centric, Stacked Architecture for Extreme-Scale, Data-Intensive Computing

One of the primary concerns of performing efficient data-intensive computing at scale is the inherent ability to exploit memory bandwidth on a local and global scale. The traditional computer architecture inherently decouples the processing interconnect from the memory interconnect, thus preventing efficient, parallel utilization of both at scale. Further, the orthogonal nature of these board-l...

متن کامل

Effect of Rhythmic Movements on working Memory, Motor Proficiency and Writing Skills in the Students with Dysgraphia

Introduction: One of the most common abnormalities of learning is dysgraphia, which refers to a serious defect in mechanical writing skills. Children with dysgraphia may not be able to perform the actions required to write or transfer information within the hearing or vision to exercise and poorly performing in cognitive skills such as organization, attention and memory. Evidence suggests that ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IBM Journal of Research and Development

دوره 59  شماره 

صفحات  -

تاریخ انتشار 2015